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Abstract. The Dirac 6 function has solid roots in 19th century 
work in Fourier analysis and singular integrals by Cauchy and oth- 
ers, anticipating Dirac's discovery by over a century, and illumi- 
nating the nature of Cauchy's infinitesimals and his infinitesimal 
definition of S. 
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1. DlRAC ON INFINITY 



The specialisation of the scientific disciplines since the 19th century 
has led to a schism between the scientists' pragmatic approaches (e.g., 
using infinitesimal arguments), on the one hand, and mathematicians' 
foundational preferences, on the other. The widening schism between 
mathematics and physics has been the subject of much soul-searching, 
see, e.g., S.P. Novikov [61J. 

In an era prior to such a schism, a seminal interaction between 
physics and mathematics was foreshadowed in a remarkable fashion, 
exploiting infinitesimals, in Cauchy's texts from 1827. Cauchy defined 
the so-called Dirac delta function in terms of infinitesimals, and applied 
it to the evaluation of singular integrals and in Fourier analysis. 

Let us first review P. Dirac's well-known discussion of 5. Dirac writes 
in his § 10: 

Consider now the single integral 



From the orthogonality theorem, the integrand here must 
vanish over the whole range of integration except the one 
point £" = [. ..] in general (£x\£'y) must be infin- 
itely great in such a way as to make (29) non-vanishing 
and finite. The form of infinity required for this will be 
discussed in § 15 (Dirac 1958, pH p. 39]). 

Note that Dirac explicitly speaks of "infinitely great" values of the 
integrand in his formula (29), and does not shy away from discussing 
"the form of infinity required" . Dirac resumes his discussion of S in his 
§ 15 in the following terms: 

Our work in § 10 led us to consider quantities involv- 
ing a certain kind of infinity. To get a precise notation 
for dealing with these infinities, we introduce a quan- 
tity S(x) depending on a parameter x satisfying the con- 
ditions 



(Dirac 1958, pH p. 58]). 

Dirac views his 5 as a way of "dealing with these infinities". Dirac 
is not content with merely providing such an algebraic definition, and 
proceeds to explain how one can "get a picture of 5(x)": 




(29) 
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take a function of the real variable x which vanishes 
everywhere except inside a small domain, of length e 
say, surrounding the origin x — 0, and which is so large 
inside this domain that its integral over this domain is 
unityE [• • ■ ] Then in the limit e — » this function will 
go over into 5(x) (ibid.). 

Here Dirac is using the expression "in the limit" in a generic sense@ 

Furthermore, 

The most important property of 5(x) is exemplified by 
the following equation, 

/oo 
f(x)5(x) = /(0), 
-oo 

where f(x) is any continuous function of x (Dirac [191 
p. 59]). 

The above equation is identical to one appearing in Cauchy a century 
earlier^ An additional intriguing formula appears in the context of 
Dirac's discussion of the discontinuity of the principal value of the log 
function: 

— logx = i7rS(x), (1.1) 

(XiOC oc 

accompanied by a comment to the effect that "this particular combi- 
nation of reciprocal function and 5 function plays an important role in 
the quantum theory of collision processes" (Dirac [T9, p. 61]). 

2. Modern Cauchy scholarship 

The earliest mention of Cauchy's 5 appears to be in H. Freudenthal. 

2.1. Freudenthal's biography. Cauchy's anticipations of 5 were al- 
luded to in 1971 by Freudenthal, who mentioned S functions twice in 
his Cauchy biography for the Dictionary of Scientific Biography [27]. 
They were the subject of scholarly attention by J. Liitzen (1982, [57] ) 
and D. Laugwitz (1989, [52J; 1992 [53]). A spate of recent articles have 

^Dirac does not specify a function, merely mentioning that the procedure works 
"provided there are no unnecessarily wild variations" . A century earlier Cauchy 
did specify a function (see formula (|3.2I) and footnote ITTT) . 

2 Dirac's "limit" can be interpreted mathematically either in terms of distribution 
theory a la Sobolev and Schwartz, or in the sense of an ultraproduct. 

3 See formula ([XT]) . 

4 The imaginary part of the principal value of log(a;) in (jl.lj) is for x > and in 
for x < 0. Thus, Im(log(a;)) = —iirH(x) where H(x) is the Heaviside function 
(which equals —1 for negative x and for positive x), whose derivative is S(x) in 
the sense discussed in Section^ 
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undertaken a re-evaluation of Cauchy's foundational contribution to 
the development of infinitesimal analysis: Brating (2007, [9]), Barany 
(2011, [1]), Blaszczyk et al. (2012, [5J), Borovik et al. (2011, [7]), Katz 
& Katz [UJ|39]. The present text will focus on Cauchy's anticipations 
of the Dirac delta. 

J. Liitzen [57] traces the origins of distribution theory (traditionally 
attributed to S. Sobolev and L. Schwartz), in 19th century work of 
Fourier, Kirchhoff, and Heaviside. An accessible introduction to this 
area may be found in J. Dieudonne [IB] . 

2.2. Dieudonne's review. Dieudonne is one of the most influential 
mathematicians of the 20th century. A fascinating glimpse into his 
philosophy is provided by his 1984 review [TBJ of J. Liitzen's book. At 
the outset, Dieudonne poses the key question: 

One [. . . ] may well wonder why it took more than 30 
years for distribution theory to be born, after the the- 
ory of integration had reached maturity (Dieudonne [TBI 
p. 374]). 

This remark inscribes itself in a tradition of a long-established and 
pervasive dogma, to the effect that 20th century physicists were the first 
to invent the delta function. Thus, M. Bunge, responding to Robinson's 
lecture The metaphysics of the calculus, evoked the physicist's custom 

to refer to the theory of distributions for the legaliza- 
tion of the various delta 'functions' which his physical 
intuition led him to introduce (Bunge in Robinson, 1967 
[651 P- 44-45]; [661 P- 553-554]). 

But did Dirac introduce the delta function? Laugwitz [521 p. 219] notes 

that probably the first appearance of the (Dirac) delta function is in 

the 1822 text by Fourier [21]. 

In his review of J. Liitzen's book for Math Reviews, F. Smithies 

notes: 

Chapter 4, on early uses of generalized functions, covers 
fundamental solutions of partial differential equations, 
Hadamard's "partie finie", and many early uses of the 
delta function and its derivatives, including various at- 
tempts to create a rigorous theory for them (Smithies, 
[72]). 

At the end of his review, Smithies mentions Cauchy: "In spite of 
the thoroughness of his coverage, [Liitzen] has missed one interesting 
event — A. L. Cauchy's anticipation of Hadamard's 'partie finie' ", but 
says not a word about Cauchy's infinitesimally-defined delta functions. 
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2.3. Gilain on limits. Cauchy's use of infinitesimals is not always 
reported accurately in the literature. Thus C. Gilain claims that 

On sait que Cauchy definissait le concept d'infiniment 
petit a l'aide du concept de limite, qui avait le premier 



[2EJ footnote 67]). 

Here Gilain is referring to Cauchy's Collected Works, Serie 2, Tome 3, 
p. 19, corresponding to (Cauchy 1821, [10J p. 4]). Both of Gilain's 
claims are erroneous, as we show below. Already in 1973, Hourya 
Benis Sinaceur warned: 

on dit trop rapidement que c'est Cauchy qui a introduit 
la 'methode des limites' entendant par la, plus ou moins 
vaguement, l'emploi systematique de V epsilonisation (Si- 
naceur 1973, [701 P- 108]). 

Sinaceur pointed out that Cauchy's definition of limit resembles, not 
that of Weierstrass, but rather that of Lacroix dating from 1810 (see 
Sinaceur [701 p. 109]). 

Cauchy's primary notion is that of variable quantities, introduced in 
the following terms: 

On nomme quantite variable celle que Ton considere 
comme devant regevoir successivement plusieurs valeurs 
differentes les unes des autres (Cauchy 1821, [TQl P- 4]). 

Next, Cauchy exploits his primary notion to evoke his kinetic concept 
of limit in the following terms: 

Lorsque les valeurs successivement attributes a une meme 
variable s'approchent indefiniment d'une valeur fixe, de 
maniere a finir par en differer par aussi peu qu l'on 
voudra, cette derniere est appelee la limite de toutes 
les autres (ibid.). 

Finally, Cauchy proceeds to define infinitesimals in the following terms: 

Lorsque les valeurs numeriques successives d'une meme 
variable decroissent indefiniment, de maniere a s'abaisser 
au-dessous de tout nombre donne, cette variable devient 
ce qu'on nomme un infiniment petit ou une quantite in- 
finiment petite. Une variable de cette espece a zero pour 
limite (ibid.). 



Translation: "We know that Cauchy dchncd the concept of infinitely small by 
means of the concept of limit, which played the primary role" (Gilain). 
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Thus, Cauchy denned both infinitesimals and limits in terms of variable 
quantities. Neither is the limit concept primary, nor are infinitesimals 
defined in terms of limits, contrary to Gilain's claims. 

2.4. Hawking and Gray. In his 2007 anthology, S. Hawking repro- 
duces Cauchy's infinitesimal definition of continuity on page 639: 

the function f(x) remains continuous with respect to x 
between the given bounds^] if, between these bounds, 
an infinitely small increment in the variable always pro- 
duces an infinitely small increment in the function itself 
(Hawking, [331 P- 639]). 

But Hawking also claims on the same page 639, in a comic non-sequitur, 
that Cauchy "was particularly concerned to banish infinitesimals" . 

In a similar vein, historian J. Gray lists continuity among concepts 
Cauchy allegedly defined 

using careful, if not altogether unambiguous, limiting 
arguments (Gray 2008, [321 P- 62]) [emphasis added- 
authors] . 

But in point of fact limits appear in Cauchy's definition only in the 
sense of the extremities (endpoints, bounds) of the domain of defini- 
tionEI Contrary to Gray's claim, the arguments Cauchy used to define 
continuity were not "limiting" but rather infinitesimal. 

2.5. Laugwitz replies. Dieudonne's query mentioned in Subsection l2.2l 
is answered by LaugwitzJ^ who argues that objects such as delta func- 
tions (and their potential applications) disappeared from the literature 
due to the elimination of infinitesimals, in whose absence they could 
not be sustained. Laugwitz notes that 

Cauchy's use of delta function methods in Fourier anal- 
ysis and in the summation of divergent integrals enables 
us to analyze the change of his attitude toward infinites- 
imals (Laugwitz 1989, [521 p. 232]). 

3. From Cauchy to Dirac 

A function of the type generally attributed to Dirac (1902-1984) 
was specifically described in 1827 by Cauchy in terms of infinitesimals. 
More specifically, Cauchy uses a unit-impulse, infinitely tall, infinitely 

6 The term bounds is Hawking's translation of Cauchy's limites. 
7 See footnote [6] 

8 Laugwitz reports having been influenced by Freudenthal's allusion to Cauchy's 
work on delta functions in [271 p. 136]. 
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narrow delta function, as an integral kernel. Thus, in 1827, Cauchy 
used infinitesimals in his definition of a "Dirac" delta function [TTJ, 
p. 188]. Here Cauchy uses infinitesimals a and e, where a is, in modern 
terms, the "scale parameter" of the "Cauchy distribution", whereas e 
gives the size of the interval of integration. Cauchy wrote [TTJ p. 188]: 

Moreover one finds, denoting by a, e two infinitely small 
numbers, 

1 f a+e . i a da 7r . . . 

2jL fM a* + (/-ap = 2 f < a > ^ 

(Cauchy's 1815-1827 text is analyzed in more detail in Section H@ A 
formula equivalent to (13. ip was proposed by Dirac a century laterj^ 
The expression 

(3.2) 



a 2 + (fi — a) 2 

(for real a) is known as the Cauchy distribution in probability theory. 
Here Cauchy specifies a function which meets the criteria as set forth by 
Dirac a century laterJHI In modern terminology, the function is called 
the probability density function, and the parameter a is referred to as 
the scale parameter. Cauchy integrates F against the kernel (13. 2p as 
in (13. ip so as to extract the value of F at the point a, exploiting the 
characteristic property of a delta function^ Thus, a Cauchy distri- 
bution with an infinitesimal scale parameter produces an entity with 
Dirac-delta function behavior, exploited by Cauchy already in 1827. 

Laugwitz notes that Cauchy's formula (13. ip is satisfied when e > ct 1//2 
(as well as for all positive real values of e > 0). Today we would write 
the integral in the formally more elegant fashion as an integral over an 
infinite domainj^ 

1 f°° . a da it „. s 

but Cauchy, ever the practical man rather than a formalist, saw no 
reason to bother with subjunctive integration over domains where the 
function is perceptually indistinguishable from zero anyway. 
Furthermore, Laugwitz documents Cauchy's use of 



9 See Laugwitz [H p. 230]. 
10 See main text at footnote [3l 

11 See footnote Q] for a discussion of Dirac's criterion. 

12 From the modern viewpoint, formula (|3.ip holds up to an infinitesimal error; 
thus, the right hand side is the standard part of the left hand side, see Section [5] 
13 Modulo the proviso of footnote [T2] 
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an explicit delta function not contained under an inte- 
gral sign (Laugwitz 1989, P- 231]) [emphasis added— 
authors] , 

contrary to a claim in Dieudonne's texto An occurrence of a delta 
function not under an integral sign in Cauchy's work is discussed in 
Section [5j 

In his 1908 book Elementary Mathematics from an Advanced Stand- 
point, Felix Klein points out that the 

naive [perceptual] methods always rise to unconscious 
importance whenever in mathematical physics, mechan- 
ics, or differential geometry a preliminary theorem is to 
be set up. You all know that they are very serviceable 
then (Klein [5Ql p. 211]). 

These remarks apply perfectly well to Cauchy's 5, as well. On the other 
hand, Klein is perfectly aware of the situation on the ground: 

To be sure, the pure mathematician is not sparing of his 
scorn on these occasions. When I was a student it was 
said that the differential, for a physicist, was a piece of 
brass which he treated as he did the rest of his apparatus 
(ibid). 

Additional remarks by Klein, showing the importance he attached to 
this vital connection, appear in Section [10J 

4. Cauchy's Note XVIII 

Cauchy's lengthy work Theorie de la propagation des ondes a la sur- 
face d'un fluide pesant d'une profondeur indefinie [11] was written in 
1815. The manuscript was published in 1827 as a 300-page text, with 
a number of additional Notes at the end. The running shortened title 
used throughout is Memoire sur la theorie des ondes. 

Note XVIII, entitled Sur les integrales definies singulieres et les 
valeurs principales des integrales indetermine.es, starts on page 288. 
Cauchy recalls the notion of a singular definite integral, describing it 
in terms of an integrand that becomes "infinite or indeterminate" . He 
continues by denoting by e an "infinitely small number"0 and by a, h 



14 See discussion of Dieudonne's claim in footnote [20l 

15 Cauchy's use of term "number", rather than "quantity", in this context is 
interesting. In general he refers to infinitesimals as "quantities" . 
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two positive constants. On page 289, after choosing an additional "in- 
finitely small number" a, Cauchy writes down the integral 

1 f a+e . > a da 7r . . 

(already reproduced as formula (13. ip above), which he denotes by (2). 
Cauchy proceeds to point out that, since the integrand of his equa- 
tion (2) is sensiblement egale a zero [essentially equal to zero] for all 
values of /i qui ne sont pas tres rapprochees de a [which are not too 
close to a] , it follows that the integrals appearing in his earlier Note VI 
reduce to singular integrals determined by his equation (2). 

Note XVIII then proceeds to discuss principal values and to offer 
alternative derivations of a number of earlier results, and is concluded 
on page 299. 

5. Cauchy's 1827 Memoire 

An additional occurrence of a delta function occurs in Cauchy's 
brief 1827 text Memoire sur les developpements des fonctions en series 
periodiques [12] . The text contains an (a priori doomed) attempt to 
prove the convergence of Fourier series under the sole assumption of 
continuity. What concerns us here is his, correct, use of infinitesimals 
at a certain stage in the argument. Cauchy opens his memoire with 
a discussion of the importance of what are known today as Fourier 
series, in "a large number of problems of mathematical physics" [121 
p. 12]. On page 13, Cauchy denotes by e un nombre infiniment petit 
[an infinitely small number], lets 9 = 1 — 6, and lets x be between 
and a = 2n. On page 14, he points out that the expression 

(his notation is slightly different) "will be essentially zero, except when \x 
differs very little from x" . Note that the expression (15. ip appearing on 
Cauchy's page 14, does not occur under the integral sign (it was ex- 
ploited as a kernel in the last formula on the previous page 13). 

Cauchy then sets /i = x + iw and concludes that the integral will be 
essentially reduced tco 



/(*) ■ / 7—— + 1 dw = 2 *f( x )' 

1 + iw 1 — iw 



16 



Cauchy wrote a where we wrote 2ir. 
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6. Triumvirate history 

Studies seeking to document continuity between Cauchy's infinites- 
imals and modern set-theoretic implementations of infinitesimals have 
not always been viewed sympathetically by commentators. 

In 1949, C. Boyer hagiographically described G. Cantor, R. Dedekind, 
and K. Weierstrass as "the great triumvirate" [H p. 298]. The "tri- 
umvirate" historian tends to view the history of mathematics as an in- 
eluctable march toward the radiant future of Weierstrassian epsilontics. 
Such a stance leaves little room for a sympathetic view of a continuity 
between infinitesimals as practiced prior to the triumvirate, on the one 
hand, and 20th century set-theoretic implementations of infinitesimals, 
on the other. J. Dauben wrote: 

In bringing historians of the calculus to task, Robinson 
was particularly critical of Carl Boyer's The Concepts of 
the Calculus. The history of mathematics was on shaky 
ground, Robinson felt, if it chose to pass judgment on 
earlier theories based upon currently fashionable prej- 
udices. Nonstandard analysis cast a new light on the 
history of the calculus, and Robinson was interested to 
see how it might appear if reexamined without assuming 
that infinitesimals were wrongheaded or at all lacking in 
rigor (Dauben 1995, [13 p. 349]). 

Dauben appears sympathetic to Robinson's interest in re-examining 
the history of infinitesimals. It is all the more surprising therefore to 
read a text Dauben authored a few years earlier. Here Dauben presents 
a list of authors, including D. Laugwitz, who "have used nonstandard 
analysis to rehabilitate or 'vindicate' earlier infinitesimalists" , and con- 
cludes: 

Leibniz, Euler, and Cauchy [. . . ] had, in the views 
of some commentators, "Robinsonian" nonstandard in- 
finitesimals in mind from the beginning. The most de- 
tailed and methodically [sic] sophisticated of such treat- 
ments to date is that provided by Imre Lakatos; in what 
follows, it is his analysis of Cauchy that is emphasized 
(Dauben 1988, [M p. 179]). 

However, Lakatos's treatment was certainly not "the most detailed and 
methodically sophisticated" one by the time Dauben's text appeared in 
1988. Thus, in 1987, Laugwitz had published a detailed scholarly study 
of Cauchy in Historia Mathematica (Laugwitz [51]). Laugwitz's text in 
Historia Mathematica appears to be the published version of his 1985 
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preprint Cauchy and infinitesimals. Laugwitz's 1985 preprint does ap- 
pear in Dauben's bibliography (Dauben, 1988 [16, p. 199]), indicating 
that Dauben was familiar with it. It is odd to suggest, as Dauben seems 
to, that a scholarly study published in Historia Mathematica would 
countenance a view that Cauchy could have had "'Robinsonian' non- 
standard infinitesimals in mind from the beginning". Surely Dauben 
has committed a strawman fallacy here. 

Rather, Lakatos, Laugwitz, Cutland et al. [15] . Brating (2007, [9]) 
and others have argued that infinitesimals as employed by Cauchy have 
found set-theoretic implementations in the framework of modern the- 
ories of infinitesimals, just as Kanovei had done for Euler in 1988 [55] . 
The existence of such implementations indicates that the historical in- 
finitesimals were less prone to contradiction than has been routinely 
maintained by triumvirate historians, who invariably cite Berkeley's 
flawed empiricist critique. The issue is dealt with in more detail by 
Katz & Katz [II]. [59]. pj. pj: Blaszczyk et al. [5J; Borovik et al. [7]; 
Katz & Leichtnam and Katz & Sherry [351 HB]. 

What criteria can we employ to evaluate the achievements of modern 
theories of infinitesimals? Remarkably, just such a criterion was explic- 
itly provided by both Felix Klein and Abraham Fraenkel, as discussed 
in Section [7J 

7. Klein-Fraenkel criterion 

In 1908, Klein formulated a criterion of what it would take for a 
theory of infinitesimals to be successful. Namely, one must be able to 
prove a mean value theorem for arbitrary intervals, including infinites- 
imal ones: 

The question naturally arises whether [. . . ] it would 
be possible to modify the traditional foundations of in- 
finitesimal calculus, so as to include actually infinitely 
small quantities in a way that would satisfy modern de- 
mands as to rigor; in other words, to construct a non- 
Archimedean system. The first and chief problem of this 
analysis would be to prove the mean-value theorem 

f(x + h)-f(x) = h-f'(x + $h) 

from the assumed axioms. I will not say that progress 
in this direction is impossible, but it is true that none of 
the investigators have achieved anything positive (Klein 
1908, [501 P- 219]). 
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In 1928, A. Fraenkel (251 PP- 116-117] formulated a similar requirement 
in terms of the mean value theorem. Such a Klein-Fraenkel criterion is 
satisfied by the Hewitt-Los-Robinson theory. Indeed, the mean value 
theorem is true for an arbitrary hyperreal interval by the transfer prin- 
ciple. Fraenkel's opinion of Robinson's theory is on record: 

my former student Abraham Robinson had succeeded in 
saving the honour of infinitesimals - although in quite a 
different way than CohenE] and his school had imagined 
(Fraenkel 1967, [261 p. 107]). 

8. Set-theoretic implementation of infinitesimals 

In Section [61 we clarified the role of modern theories of infinitesimals 
in interpreting the work of historical infinitesimalists. Here we present 
some details of a particular set-theoretic implementation of infinitesi- 
mals as developed through the work of Hewitt, Los, and Robinson. For 
an alternative approach to infinitesimals, see P. Giordano 

In 1961, Robinson [M] constructed an infinitesimal-enriched con- 
tinuum, suitable for use in calculus, analysis, and elsewhere, based 
on earlier work by E. Hewitt [35], J. Los [56], and others. In 1962, 
W. Luxemburg [58] popularized a presentation of Robinson's theory in 
terms of the ultrapower construction^ in the mainstream foundational 
framework of the Zermelo-Fraenkel set theory with the axiom of choice. 
Namely, the hyperreal field is the quotient of the collection of arbitrary 
sequences, where a sequence 



converging to zero generates an infinitesimal. Arithmetic operations 
are defined at the level of representing sequences; e.g., addition and 
multiplication are defined term-by-term. 

To motivate the construction of the hyperreals, we will start with 
the ring Q N of sequences of rational numbers. Let Cq C Q n denote the 
subring consisting of Cauchy sequences. The reals are by definition the 



The reference is to Hermann Cohen (1842-1918), whose fascination with in- 
finitesimals elicited fierce criticism by both Cantor and B. Russell. For an analysis 
of Russell's non-sequiturs, see Ehrlich [5D] and Katz & Sherry [35] . 

18 Note that both the term "hyper-real", and an ultrapower construction of a 
hyperreal field, are due to E. Hewitt in 1948, see [351 P- 74]. Luxemburg [58] clar- 
ified its relation to the competing construction of Schmieden and Laugwitz [68], 
also based on sequences, but using the ideal J- ez - Dauben |17[ p. 395] mistak- 
enly suggests that it was Luxemburg who initiated the ultrapower approach to the 
hyperreals using free ultrafilters. 



(U!,U 2 ,U 3 , . . .) 
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quotient fielcQ 

R := Cq/J^um, (8.2) 

where J-^ contains all null sequences. An infinitesimal-enriched exten- 
sion of Q may be obtained by modifying (18.21) as follows. We consider a 
subring T ez C Jyh* of sequences that are "eventually zero", i.e., vanish 
at all but finitely many places. Then the quotient C^jT ez naturally 
surjects onto R = Cq/Jvh^. The elements in the kernel of the sur- 
jection C^jT ez —> R are prototypes of infinitesimals. Note that the 
quotient Cq/J-^ is not a field, as J-" e2 is not a maximal ideal. It is more 
convenient to describe the modified construction using the ring R N 
rather than Cq. 

We therefore redefine T ez to be the ring of sequences in R N that 
eventually vanish, and choose a maximal proper ideal hA so that we 
have 

T ez C M C R N . (8.3) 

Then the quotient HR := R N /.M is a hyperreal field. The foundational 
material needed to ensure the existence of a maximal ideal M. satisf- 
ing (I8.3p is weaker than the axiom of choice. This concludes the con- 
struction of a hyperreal field 1IR in the traditional foundational frame- 
work, Zermelo-Fraenkel set theory with the axiom of choice (ZFC). 

Let I C HR be the subring consisting of infinitesimal elements (i.e., 
elements e such that |e| < - for all n EN). Denote by J -1 the set of 
inverses of nonzero elements of /. The complement HR\/ _1 consists of 
all the finite (sometimes called limited) hyperreals. Constant sequences 
provide an inclusion R C HR. Every element x E HR \ I~ l is infinitely 
close to some real number x$ E R. The standard part function, denoted 
"st" , associates to every finite hyperreal, the unique real infinitely close 
to it: 

st : 1IR \ r 1 ->• R, x^x . 

If x happens to be represented by a Cauchy sequence (x n : n E N), so 
that x = [(x n )], then the standard part can be expressed in terms of 
the ordinary limit: 

st(x) = lim x n . 

n— ¥oo 

More advanced properties of the hyperreals such as saturation were 
proved later (see Keisler [19] for a historical outline). A helpful "semi- 
colon" notation for presenting an extended decimal expansion of a hy- 
perreal was described by A. H. Lightstone [51] . See also P. Roquette [67] 
for infinitesimal reminiscences. A discussion of infinitesimal optics is in 



Such a construction is usually attributed to Cantor, and is actually due to 
Meray 1869, [SS] who published three years earlier than E. Heine. 
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K. Stroyan [73], J. Keisler jH] and others. Applications of infinitesimal- 
enriched continua range from aid in teaching calculus [211 E3 EH] to the 
Bolzmann equation (see L. Arkeryd [21 13]) and mathematical physics 
(see Albeverio et al. [1]). Edward Nelson [60] in 1977 proposed an 
alternative to ZFC which is a richer (more stratified) axiomatisation 
for set theory, called Internal Set Theory (1ST), more congenial to in- 
finitesimals than ZFC. The traditional construction of the reals out of 
Cauchy sequences can be factored through the hyperreals (see Gior- 
dano et al. |30j). The hyperreals can be constructed out of integers 
(see Borovik et al. [6]). 

9. Heaviside function 

Having clarified the connection between historical infinitesimals and 
their modern set-theoretic implementations in Sections M and [HI we 
return to delta functions and their integrals. 

Dieudonne's review of Liitzen's book is assorted with the habitual, 
and near-ritual on the part of some mathematicians, expression of dis- 
dain for physicists: 

However, a function such as the Heaviside function on M, 
equal to 1 for x > and to for x < 0, has no weak 
derivative, in spite of its very mild discontinuity; at 
least this is what the mathematicians would say, but 
physicists thought otherwise, since for them there was 
a "derivative" S, the Dirac "delta function" (Dieudonne 
[TS1 p. 377]) [the quotation marks are Dieudonne's— 
authors]. 

Dieudonne then proceeds to make the following remarkable claims: 

Of course, there was before 1936 no reasonable mathe- 
matical definition of these objects; but it is characteris- 
tic that they were never used in bona fide computations 
except under the integral sign^ giving formula^ such 

as 

J 5(x - a)f(x) = f(a). 

Are Dieudonne's claims accurate? Dieudonne's claim that, before 1936, 
delta functions occurred only under the integral sign, is contradicted 

20 We are not certain what bona fide calculations are exactly, but at any rate 
Dieudonne's claim that delta functions were never used except under an integral 
sign, would be inaccurate; see main text at footnote 1141 

21 Here we have simplified Dieudonne's formula in [T5J p. 377], by restricting to 
the special case n = 0. 
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Figure 9.1. Heaviside function 



by Cauchy's use of a delta function not contained under an integral 
sign, over a hundred years earlier (see Section [5]). 

Are the physicists so far off the mark in speaking of the delta function 
as the derivative of the Heaviside function? Is it really true that there 
was no reasonable mathematical definition before 1936, as Dieudonne 
claims? In fact, Cauchy had a reasonable mathematical definition of 
a delta function, though of course both set theory and set-theoretic 
implementations of his ideas were still decades away. 

Thus, consider the zigzag Z C M? in the (x, y)-plane given by the 
union 



Z = (R_ x {-!}) U ({0} x [-|f]) U (R + x {+=}) , (9.1) 



thought of as the physicist's Heaviside function, see Figure ED 

Now consider the graph of arctan(x) in the (x, y)-plane, and com- 
press it toward the y-axis by means of a sequence of functions arctan(nx), 
or arctan(a;/a;) where a — K Their derivatives F a (x) satisfy 



by the fundamental theorem of calculus. In an infinitesimal-enriched 
continuum (see Section [8]), we can assign an infinitesimal value to a. 
Then the graph 



of arctan(x/a;) is "appreciably indistinguishable" from the plane zigzag 
Z C 1R 2 of formula f l9.ip P^ Instead of attempting to differentiate the 

22 In modern notation, this relation would be expressed by the fact that the 
standard part "st" of the graph T arctan ( x / a ) coincides with the zigzag: 




r 



St (r arcta n(a;/a)) — Z C M 2 . 
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zigzag itself as physicists are alleged to do, we differentiate its infini- 
tesimal approximation arctan(x/a), and note that we obtain precisely 
Cauchy's delta function appearing in formula (I3.ip . against which F is 
integratedEl 

10. Klein's remarks on physics 

As in the case of Cauchy's delta function, infinitesimals provide an 
intuitive point of entry to key phenomena in both mathematics and 
physics. In a similar vein, Klein discussed infinitesimal oscillations of 
the pendulum in his Elementary Mathematics from an Advanced Stand- 
point. Klein presents the derivation of the pendulum law by pointing 
out that 

it follows from the fundamental laws of mechanics that 
the motion of the pendulum is determined by the equa- 
tion § = f sin0 (Klein [5Ql p. 187]). 

Here g is the gravitational constant, while I is the length of the thread 
by which the pendulum is suspended, and <fi is the angle of deviation 
from the normal. Klein continues: 

For small amplitudes we may replace sin </> by <p without 
serious error. This gives for the so called infinitely small 
oscillation of the pendulum ^| = %<j> (ibid.). 

Klein proceeds to write down the general solution (f> = C cos «/| (t— to), 
and points out that the duration of a complete oscillation, i.e., the pe- 
riod T = 2ir^y£/g, is independent of the amplitude C. Reflecting upon 
the teaching practices at the time, Klein muses over the incongruity of 

the curious phenomenon that one and the same teacher, 
during one hour, the one devoted to mathematics, makes 
the very highest demands as to the logical exactness 
of all conclusions. In his judgment [. . . ] his demands 
are not satisfied by the infinitesimal calculus. In the 
next hour, however, that devoted to physics, he accepts 
the most questionable conclusions and makes the most 
daring applications of infinitesimals (ibid.). 

Here the function arctan(a;/a) is the mathematical counterpart of the physicist's 
Heaviside function. Of course, Cauchy did not have the notion of a standard part 
function (see Section [5]), to express the idea that an error term is infinitesimal. In- 
stead, he used the expression sensiblement nulle (sensibly nothing), see |52[ p. 231]. 

23 Takeuti [74], Giorello [31], Lightstone and Wong [55], and later Todorov [75] 
and Peraire |63] have developed this theme further. Yamashita |76] provides a 
bibliography of articles dealing with hyperreal delta functions. 
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Klein's lament concerning the existence of such artificial boundaries is 
echoed by Novikov [BT] . 

11. Charles Sanders Peirce's framework 

The customary set-theoretic framework that has become the reflexive 
litmus test of mathematical rigor in most fields of modern mathematics 
(with the possible exception of the field of mathematical logic) makes 
it difficult to analyze Cauchy's use of infinitesimals, and to evaluate its 
significance. We will therefore use a conceptual framework proposed 
by C. S. Peirce in 1897, in the context of his analysis of the concept of 
continuity and continuum, which, as he felt at the time, is composed 
of infinitesimal parts, see [33J p. 103]. Peirce identified three stages in 
creating a novel concept: 

there are three grades of clearness in our apprehensions 
of the meanings of words. The first consists in the con- 
nexion of the word with familiar experience. . . . The sec- 
ond grade consists in the abstract definition, depending 
upon an analysis of just what it is that makes the word 
applicable. . . . The third grade of clearness consists in 
such a representation of the idea that fruitful reasoning 
can be made to turn upon it, and that it can be applied 
to the resolution of difficult practical problems [52] (see 
Havenel 2008, [331 p. 87]). 

The "three grades" can therefore be summarized as 

(1) familiarity through experience; 

(2) abstract definition aimed at applications; 

(3) fruitful reasoning "made to turn" upon it, with applications. 

To apply Peirce's framework to Cauchy's notion of infinitesimal, we 
note that grade (1) is captured in Cauchy's description of continuity 
of a function in terms of "varying by imperceptible degrees". Such a 
turn of phrase occurs both in his letter to Coriolis (see Cauchy 1837, 
[13]), and in his 1853 text [H P- 35] @ At Grade (2), Cauchy describes 
infinitesimals as generated by null sequences (see [9]), and defines con- 
tinuity in terms of an infinitesimal x-increment resulting in an infin- 
itesimal change in y. Finally, at stage (3), Cauchy fruitfully applies 
the crystallized notion of infinitesimal both in Fourier analysis and in 
evaluation of singular integrals, by means of a "Dirac" delta function 

24 Note that both Cauchy's original French "par degres insensibles" , and its cor- 
rect English translation "by imperceptible degrees", are etymologically related to 
sensory perception. 
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defined in terms of a (Cauchy) distribution with an infinitesimal scaling 
parameter. 

From the viewpoint of the Peircian framework, and contrary to 
Dieudonne's claim, Cauchy did have a reasonable mathematical def- 
inition of a, Dirac, delta function, though both set-theory and set- 
theoretic implementations of infinitesimal-enriched continua in which 
Cauchy's definition could be made operative were still decades away. 
Cauchy's definition could be compared to the definitions of the fun- 
damental concepts of infinitesimal calculus furnished by Newton and 
Leibniz. The founders of the calculus, similarly to Cauchy, lacked a set- 
theoretic formalisation of a continuum, and yet are (rightfully) given 
credit for the fundamental concepts they introduced. Cauchy exploited 
infinitesimals both in his definition of continuity in 1821, and in his def- 
inition of a notion closely related to uniform convergence in 1853 (see 
Katz Sz Katz [39] and Blaszczyk et al. [5]). The centrality of infinites- 
imals in Cauchy's approach to analysis is further clarified through his 
use thereof in defining "Dirac" delta functions. 
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ism that represents both process and concept. Procept and concept 
image were declared two of the five major contributions to mathemat- 
ics education by founder president, Efraim Fischbein, reviewing the 
first 25 years of the International Group for the Psychology of Math- 
ematics Education. He was the most quoted author in the review 
of the group's first thirty years. In 2004, introduced the theoretical 
framework of Three Worlds of Mathematics, based on human embod- 
iment (perception and action), operational symbolism and axiomatic 
formalism together with an analysis of the supportive aspects that en- 
courage generalization and problematic aspects that impede progress. 
He has cooperated with Mikhail Katz, applying the three- world frame- 
work to the work of Cauchy and subsequent developments in formal- 
ism and the use of infinitesimals. His papers are all available from 
|http: / /www.warwick.ac.uk/staff/David.TaiTj 
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